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Abstract. In this paper we define and study a generalized version of facility location problem in 
which facility cost functions depend on the number of clients assigned to the facility. There is an 
associated cost function for each facility that depends on the number of clients assigned to it. We 
focus on the case of concave facility cost functions, and present greedy 1.94 and 1.52 approximation 
algorithms for this case. We will also consider various generalizations and variants of the problem and 
give an O(log n) approximation algorithm for the non-metric generalized facility location problem. 


1 Introduction 


The facility location problem is a central problem in operation research. In this problem, we have a set of 
clients and a set of facilities, and we want to connect each client to a facility in a way that minimizes the 
total cost. There are two types of costs associated with a solution: the connection costs and the facility 
costs. The connection cost between each client j and facility 7 is a number c;; that is given. We will have 
to pay this amount if we want to connect client 7 to facility 7. Each facility 7 also has a facility cost f;, that 
is the amount we have to pay if we decide to connect at least one client to it. The cost of a solution is the 
summation of the connection costs and the facility costs associated with the solution. We will generally 
assume that the connection costs obey the metric inequality. 


The facility location problem is well-studied in the field of approximation algorithms, and a number of 
different approximation algorithms have been proposed for this problem using a variety of techniques [11, 
14, 6,7, 10, 8, 1-5, 8, 9, 13]. 

A natural generalization of the facility location problem is to allow the facility cost of a facility to be an 
arbitrary concave function of the number of cities connected to it. This question was first asked by Tom 
Leighton and was motivated by its applications in placing servers on the Internet. The reason it is assumed 
that the facility cost function is concave is the following principle of economics: As the number of clients 
increases, the cost per client will decrease, since they share some common expenses. 


The main result of this paper is a 1.94 approximation algorithm for the concave facility location problem. 
Our algorithm is a natural generalization of an algorithm by Jain et al. [7]. The analysis uses the techniques 
of dual-fitting and factor-revealing LPs introduced in [6]. We will also consider more general versions of 
the problem by relaxing the conditions of concavity for facility costs and metricity for connection costs. 


This paper is organized as follows. In section 2, we give a formal definition of the problem, and observe 
that it can be reduced to the capacitated facility location problem with hard capacities. In section 3, we 
present a greedy 1.94 approximation algorithm for the concave facility location problem. In section 4, we 
prove the approximation factor 1.94 for our greedy algorithm. In sections 5 and 6, we study generalizations 
and variants of the problem such as allowing the facility cost function to be slightly non-concave, facility 
location with convex cost functions, and relaxing the metric inequality for the connection costs. 


2 The problem 


The generalized facility location problem is define as follows. We are given a set F of facilities (a.k.a servers); 
aset D of clients (a.k.a. cities or demands); a facility cost function f; : N — N for every facility i € F, which 
specifies the cost of the facility as a function of the number of clients served by it; and finally connection 
costs cj; between facility i and client j. We generally assume that the connection costs are metric (i.e., is 


symmetric and satisfies the triangle inequality), unless otherwise stated. The objective of the problem is 
to find an assignment w of all clients to facilities (i.e., yw : D+ F) with minimum total cost. The total 
cost is the sum of connection costs (S)j;ep Cy(j),j) and facility costs (Wee fi(l{7 € D: YG) = t}I))- 


The focus of this paper is on the concave facility location problem, i.e., the generalized facility location 
problem with a guarantee that all facility cost functions are concave. Recall that a function f: N > N is 
called concave if and only if for each « > 1, f(a +1) — f(x) < f(x) -— f(x—1). 


We observe that the concave facility location problem can be reduced to the capacitated facility location 
with hard capacities. In the capacitated facility location problem, each facility i also has a capacity uj, 
which is the maximum number of clients that can be served by this facility. The problem has two variants; 
in the first one, the capacities are hard in that facility 1 can be opened at most once to serve at most u; 
demands; in the second variant the capacities are soft; that is, facility i may be opened k times to serve up 
to ku; demands at a cost of kf;. It is easy to observe that both of these problems are special cases of the 
generalized facility location problem. The capacitated facility location problem with soft capacities can be 
reduced to the uncapacitated facility location problem [11], and the best known approximation algorithm 
for this problem [11] is obtained via this reduction. However, hard capacities are known to be problematic 
for techniques that rely on linear programs (such as LP-rounding and primal-dual algorithms). The main 
problem is that natural linear program has a large integrality gap for the case of hard capacities. Recently, 
Pal et al. [12] used local search techniques to give the first constant factor approximation algorithm for 
this problem, achieving a factor of 9 + «. However, the running time of this algorithm is prohibitive for 
applications with large data sets. 


Proposition 1. Assume there is a polynomial time a-approximation algorithm for the capacitated facility 
location problem with hard capacities. Then the concave facility location problem can be approximated within 
a factor of a in polynomial time. 


Proof. For each facility i € F, we place n copies of this facility, where the 7’th copy has opening cost 
fi(j) and capacity 7. We use the a-approximation algorithm for the capacitated facility location problem 
to solve this instance. Since the facility cost function is concave, we can assume without loss of generality 
that at most one of these n copies is opened in this solution. Therefore, the solution to the capacitated 
facility location instance can be easily transformed to a solution for the concave facility locatino problem 
with the same total cost. 


The above proposition together with the algorithm of Pal et al. [12] gives a 9+ «approximation algorithm 
for the concave facility location problem. In the next section, we will show a greedy algorithm for this 
problem achieving a factor of 1.94. 


3 The Greedy Algorithm 


In this section, we present the main result of this paper that is a greedy 1.94-approximation algorithm 
for the concave facility location problem. The approximation factor of this algorithm is much better than 
the algorithm mentioned in the proof of Theorem 1. In addition, since the previous algorithm uses the 
approximation algorithm of the capacitated facility location problem with hard capacities, and the current 
algorithm for this problem uses the local search method, its running time is very high, especially because 
we reduce an instance of concave facility location with O(n) facilities to an instance of capacitated facility 
location with O(n?) facilities. Our method is an extension of the method used by Jain et al. [7] and 
Mahdian et al. [10]. First, we define stars. 


Definition 1. A star consists of one facility and several chents. For star S, consisting of clients a1, a2,..., Gr 
and facility p, cost of S, denoted by c(S) or cg, is defined as yy Cp,a; + fp(k) t.€., the sum of the con- 
nection costs of clients to the facility p plus the facility cost function of p for k clients. 


Suppose an algorithm finds a solution of cost @ to the concave facility location problem, and also it finds 
values aj; for every client 7 € C as the contribution of client j in the total cost. In addition, assume 


eC aj = 6 and there is a constant y > 1 such that for every star S, DyesnC aj < yes. We now consider 
an optimal solution OPT to the problem. Let i be a facility that is opened in OPT. For set D of clients that 
are connected to facility 7 in OPT, we can write jp aj < V(fit Vjcp Cj) © Vjep aj < yes where S 
is the star consisting of 1 and D. By summing up the inequalities for every star that is picked in OPT, we 
obtain 6 = eee a <UL seOPT Cs = 7: cost(OPT). Therefore if we can find such an algorithm with a 
constant 7, y is the approximation factor of the algorithm. 


It is worth mentioning that this approach can also be considered using LP-duality. The problem can be 
formalized by an integer linear program based on stars, and the a;’s are the variables of the dual program 
in which we relax inequalities by a constant factor . The reader is referred to Jain et al. [7] to see this 
method called the dual-fitting method in more details. 


In the next section using the approach discussed above, we show that simple greedy Algorithm A pre- 
sented below is a 1.94 approximation algorithm for the concave facility location problem. In this algorithm, 
we use a notion of time (lines 1 and 6), such that every event can be associated with the time at which it 
happened. Also each client 7 has a budget from which it can offer some money to facilities; if j is uncon- 
nected and its budget is more than the cost of the connection to a facility 7, it offers the extra budget to 
i (line 9); and if j is connected to a facility i’, it offers to a facility i the amount by which it can save by 
switching its facility from i’ to i (line 10). We note that at any time, the budget of each connected client 
is equal to its current connection cost plus its total contribution toward open facilities. 


Algorithm A: greedy algorithm for concave facility location 
Input: Metric connection costs c;; for each facility 7 and client j. 
Concave facility cost functions f; : N — N for each facility 7. 
Output: For each client 7, a facility p(j) to which 7 is assigned. 
For each client j, contribution of client j to the total cost (a;). 


begin 

1 lett= 

2 for each facility i let level, = 0 

3 for each client 7 

4 let p(j) =null 

5 let budget(j) = 0 

6 while there is an unconnected client increase time t 

7 for each unconnected client j let budget(j) = t 

8 for each client j 

9 if p(j) =null let offer(j,i) = max(budget(j) — ci;,0) 
10 else let offer(j,i) = max(cp(j); — cij, 0) 

11 while there is a facility 1 and k — level; clients a1,---,4,_Jeyel, (k > level;) 


which contains at least one unconnected client and Mae * offer(a;,7) = fi(k) — fi(level;) 


12 let level; =k 

13 for each 1 < 7 < k — level; let p(a;) =i 

14 for each client j 

15 let a(j) = budget(j), the time that 7 first gets connected 
end 


The proof of the following lemma is clear from the algorithm and the discussion above. 
Lemma 1. The total cost of the solution found by Algorithm A is equal to the sum of a;’s. 


The above algorithm is similar to the greedy algorithm of Jain et al. [7]. The difference is that here we 
define the concept of level for facilities that is the number of clients assigned to it and the events in which 
we assign clients to facilities depend on the level of vertices. 


4 The approximation factor 


In this section, we show that Algorithm A is indeed a 1.94-approximation algorithm for the concave 
facility location problem. We prove this by showing that for each star S, the ratio of the sum of a,;’s of all 


clients contained in S to the total cost of S is at most 7 ~ 1.94. Analogous to the work of Jain et al. [7], 
our approach is as follows. First, based on the behavior of the algorithm we obtain some linear constraints 
called factor revealing-LP on a;’s and the cost of S. Next, we show that for any feasible solution of the LP 
(not necessarily for the one obtained from the algorithm) our objective ratio is at most y ~ 1.94. Here, 
an LP-solver helps us to guess such a ratio, and then using complicated calculations we prove this upper 
bound. 


To derive the factor-revealing LP, first we need some definitions and notations. Consider a star S consisting 
of a facility p and k clients numbered 1 through k. Let dj; denote the connection cost between facility p 
and client j, and a; denote the share of j of the total expenses (see the definition of a in the algorithm). 
The cost of the star is f,(k) + eam d;. For simplicity, we set f = f,(k). Without loss of generality, we 
assume ay < a2 <--- < ag. Let the critical time for a client i be the time just before i gets connected for 
the first time, i.e., when t = a; — € where € is very small. At the critical time for client 1, each of the clients 
1,2,...,7-1 Hehe be connected to a facility. For every j < i, if client 7 is connected to some facility at 
time t, let rj denote the connection cost between this facility and client j; otherwise, let rj; := aj (in 
this case a; = aj). 


First we note that since the budget of a client remains constant when it gets connected to a facility, and it 
may not get connected to another facility with a higher connection cost, 7jj41 > Tjj42 > °° > 75,4. Now 
we obtain more constraints. 


Lemma 2. At the critical time for a client i, for every subset S; C {1,...,i —1} and every subset 
S2 c fi,sicny kh} (Se # 0), 
S5 max(r;,; — dj,0) + $5 max(aj — dj,0) < fp(|S1| + [S2l). (1) 
JES. JES2 


In particular, Sa max(rj,; — d;,0) + ye ;max(a; — d;,0) < fp(k) = f. 

Proof. The amount client j offers to facility f at time t = aj;—€ is max(rj,;—d,;,0) if j < i, and max(t—d;, 0) 
if 7 > 4. By the definition of r;; this holds even if 7 < 7 and a; = aj. From the algorithm, the total offer 
of clients in S| U S_ to facility p may not become larger than the cost of facility p at level || + |So|, 
since otherwise all these clients were assigned to the facility p. Thus, for all 7, }0;-5, max(rj,i — dj,0) + 
Dyes, max(ai — dj,0) < f,(|Si| + |S2|). In particular by setting S; = {1,...,i— 1} and Sp = {i,...,k}, 


yee max(1j,4 — dj, 0) + ee max(a; — dj, 0) <f. Oo 


So far, we have not used the triangle inequality of connection costs and concavity of facility cost functions. 
We use these assumptions in the next lemma. 


Lemma 3. At the critical time for a client i, for all clients 7 such that 1 <j <i, 
aj Sag trie t+ di + dj. (2) 


Proof. Let p' be the facility that 7 is connected to at time t = a; — . By the triangle inequality and 
the definition of r;;, the connection cost c,; between client i and facility p' is at most cp; + dj + dj. It 
is not hard to see that by the definition of rj, we have cp; < rj. Thus these two inequalities imply 
Cyi S774 + di + dj. 

Furthermore, if the level of p' is equal to | when client j gets connected to it, cy»; can not be less than 
t+ fp (l+1) — fp (I), since otherwise the client i should be connected to the facility p’ at a time earlier than 
t, which is a contradiction. This shows that: a; —-€ =t < cy; t+ fp (l+1) — f, (I). From the last inequality 
and this one, a; < rj, +d; +d; + fp (l+1) — fp (1). From the fact that fp is a concave function, it turns 


out that for all gq <1, fy (J+1)— fp (1) < fetta), Now consider the time a; at which j gets connected 
to facility p’. Let g be the level of p' before time a; and at this time J — q new clients, bi, b2,...,by_q are 
assigned to facility p'. At time a; the total amount of these /— clients’ offer to p’ is equal to fp (1) — fp (q). 
The amount of b;’s offer to facility p’ is equal to either aj — Cp,» OF Cb,» — Cb,p/ depends on the situation 


of client b; at time a; (whether it was assigned to a facility p” or not). In either case, b,’s offer is less than 
or equal to a; — Cp,p', thus 


l—q i=9g 
fp (1) = fo (q) = S © offer(bi) < So aj = ( i g)a; => f= tol) < a; 
i=1 i=1 


Combining all these inequalities together, we get the following inequality. For every 1 < j <i < k, 
a, < 754 +d; +d; +a0;. Oo 


The following optimization program, called the factor-revealing LP, can be obtained from the above in- 
equalities. We note that by scaling f + ee d; = 1 and introducing new variables and new constraints for 
function max we can obtain a linear program. 


k 
Dini Vi (3) 
f+ hid 
subject to Vl<i<k: aj < agi 
Vi<jg<i<k: Ti 21 ji41 
Vi<j<isk: aj <ajt+rjitdjtd; 
VI<i<k,& C {l,...,8-1}, 52 C fi,...,k}: 5° max(ry;— dj,0) 
jES1 
+ $5 max(a; — dj,0) < fp(S1| + [S2l) 
JES2 
VI<j<sicsk: aj, dj, f,7j,4 2 0 


maximize 


The size of the above program is large (exponential) because of the forth set of inequalities and it is hard 
to find out the solution of the problem for large k’s. In order to solve this problem, we observed that using 
Lemma 2, we can relax the forth set of inequalities and still get the approximation factor 1.94. 


eae (4) 
f+ iad 
subject to Vl<i<k: aj < agi 
VIi<j<i<sk: ry3>7rji41 
VIi<j<isk: qa<ajtrjyytdtd; 


maximize 


i-1 k 
V1<i<k: So max(rj,; — d;,0) + So max(a; —dj;,0)<f 
j=l j= 


V1 <j < a < k: a5, 45, f, 75, > 0 
After this relaxation, the number of inequalities in the optimization program is polynomial. 


Theorem 1. Let 7 be sup, {zx}, where z, is the solution of the factor-revealing LP. Then Algorithm A 
gives a y-approzimation algorithm for the concave facility location problem. 


Proof. Since the values a;, dj, f and rj; obtained from Algorithm A satisfy all inequalities in the LP, 
the values of the objective function for them is at most z,. It implies for each star S consisting of one 
facility and k clients i1,---, ix, yes a4, is at most z,c¢s. The proof of the lemma follows from this fact 


and Lemma 1. ' Oo 


In the next step, we use an LP-solver like CPLEX to obtain the optimum solution of the factor revealing 
LP for fixed k. The results are shown in Table 1. 


10 {1.83517 
20 |1.88389 


50 {1.91573 
100}1.92652 
200}1.93193 


Table 1. Solutions of the factor-revealing LP 


Using these experimental results one can observe that -y = 1.94; however all of them are just lower bounds 
for the LP. To obtain the desired approximation ratio, we need to prove an upper bound 1.94 on the 
maximum solution of the LP. The proof needs so much calculations, and thus it is presented in Appendix 
A. It is worth mentioning Mahdian et al. [7,11] also use these kinds of tedious calculations for the facility 
location problem; however since our LP is different from theirs, the calculations are different. Finally we 
have the following theorem: 


Theorem 2. Algorithm A is a 1.94-approximation algorithm with running time O(n*) for the concave 
facility location problem, where n = max(nf, Nc). 


The following improvement for the concave facility location problem has been suggested by one of the 
referees of SODA 2003. It is worth mentioning, still we use Algorithm A for more general connection 
costs in Section 6. 


Theorem 3. There exists a 1.52-approximation algorithm for the concave facility location problem. 


Proof. The main idea is that in our problem any concave function f(x) can be represented by min, {(f(k)- 
f(k-1))a+kf(k—-1)—-(k-1)f(k)}, 1 < k < n. Now, we take each facility i with concave cost function fj, 
and replace it by multiple facilities i1, i2,---, i such that facility 4; has opening cost kf;(k-—1)—(k—-1) fi(k). 
In addition, each unit of demand routed to this location costs f;(k) — fi(k — 1) extra at the facility. This 
cost fi(k) — fi(k — 1) can be added to the distance metric. For this facility location problem, Mahdian et 
al. [11] have a 1.52 approximation. 


5 Generalizations 


In this section, we consider more general variants of the problem such as the problem with relaxed metric 
inequality. 


5.1 Other facility cost functions 


So far, we have shown approximation algorithms for the concave facility location problem. In this section, 
we consider the generalized facility location problem with more general facility cost functions. First, we 
give a polynomial time algorithm for the generalized facility location problem with convex facility cost 
functions. Recall that a function f is called convex if for every x > 1, f(x +1) — f(x) > f(x) — f(a —- 1). 


Theorem 4. The generalized facility location problem with convex facility cost functions can be solved in 
polynomial time. 


Proof. First we reduce the problem to the capacitated facility location problem with unit hard capacities. 
Next, we show how this problem can be solved in polynomial time. For each facility 1 € F, we place n 
copies of unit-capacity facilities where f?, the opening cost of the jth facility of the ones corresponding 
to facility 1 € F, is fig +1) -— filj), O < J < n—1. The correctness of the reduction follows from the 
fact that if we use a facility f?, we should use all facilities f*, k < j, since function f; is convex. We now 


solve the problem by minimum weighted matching on bipartite graphs. We construct a bipartite graph 
G = (X UY, E) as follows. For each client 7, we place a vertex in the set X, for each facility 1, we place a 
vertex in the set Y, and finally we place an edge {j,i} in E between a client j and a facility i with weight 
cij + fi. We can easily observe by solving minimum weighted matching [15] for G, one can find an optimum 
assignment for the original problem. Oo 


The problem of finding a constant factor approximation algorithm for general facility cost functions is 
open. Here, we observe that Algorithm A can be used to find constant factor approximation algorithms 
for cost functions that are close to concave functions. 


Definition 2. A function f : N — N is a c-close concave function if there exists a concave function g 
such that Vx EN: a2) < f(x) < g(a). The function f : N - N is c-concave if and only if for alll and q 
such that q <1, we have f(l+1)— f(l) < cl Oalo | 


In the case of c-close concave functions, we have the following simple theorem. 


Theorem 5. There is a constant factor approximation algorithm of factor 1.94c for the generalized facility 
location problem with cost functions that are c-close to concave functions. 


Proof. Consider a function g; such that & < f; < g;. We use Algorithm A to solve the problem for facility 
cost functions g;’s. We know that the cost of this solution is at most 1.94 times the optimal solution for 
facility cost functions g;’s. Using the inequality & < f; < g;, the optimal solution for g;’s is at most c 
times the optimal solution for f;’s. Thus, the approximation factor is 1.94c. 


For c-concave functions, one can observe that by a similar proof of Lemma 3, we can prove that if f;’s 
are c-concave, then for all 1 < j <i < k, aj < caj+17j4 +d; + dj. Therefore, using Algorithm A 
for functions f;’s, the approximation factor of the algorithm is the optimal solution of the same factor 
revealing LP except the third set of inequalities which are replaced by a; < ca; +1; +d; +d,;. Table 2 
shows a summary of the results obtained by solving the factor-revealing LP using CPLEX for k = 100. 
From the experimental results, it turns out that z, depends on c by an asymptotically linear function and 
the approximation factor is again a constant. 


10 [5.98046 


Table 2. Approximation factor for c-concave functions and A-parameterized metric(k = 100) 


5.2 More general connection costs 


It is easy to see that the facility location problem (and therefore the generalized faciltiy location problem) 
is NP-hard to approximate within a factor less than O(Inn) if the connection costs are not metric. Also, 
it is not difficult to see that the classical set cover algorithm can approximate the non-metric concave 
facility location problem within a factor of O(Inn). However, one can observe that Algorithm A works 
very well when the metric inequality is somewhat relaxed. In this case, instead of the triangle inequality 
(AC < AB+BC) a parameterized triangle inequality (AC < \(AB+BC)) is satisfied. It is straightforward 
to restate the proof of Lemma 3 and prove that for all 1 <j <i < k, aj < A(ry + di + dj) +.a;. 
Therefore, we have the same factor-revealing LP except the third set of inequalities which are replaced by 
ay < A(rj¢ +d; +d;) +a;. Using CPLEX, we obtained the optimum solution of this factor-revealing LP for 
k = 100. Table 2 shows the empirical results for different values of A. These results show that Algorithm 
A works well when the metric inequality is somewhat relaxed. 


6 General connection costs 


It is easy to see that the facility location problem (and therefore the generalized faciltiy location problem) 
is NP-hard to approximate within a factor less than O(Inn) if the connection costs are not metric. Also, it 
is not difficult to see that the classical set cover algorithm can approximate the non-metric concave facility 
location problem within a factor of O(Inn). However, this reduction does not work for the non-metric 
generalized facility location problem. For this case, we can prove that Algorithm A has an approximation 
factor of Inn, where n is the number of clients. 


Theorem 6. Algorithm A achieves an approximation factor of Inn for the generalized facility location 
problem when the connection cost is non-metric. 


Proof. Recall the definitions of a;,dj, and f in Section 4. By Lemma 2, we have yee (ai—d;) < f (Notice 
that the concavity assumption was not used in the proof of Lemma 2). Thus, 


k k 
1 1 
bog = i = ay 7 
aS ee = poi t+ 2a) 
It follows that 
k k i k k 
doa s pa = Hi(f + > dj) < (Innies. 
i=1 i=1 fal pan 


The above theorem implies that the capacitated facility location problem can be approximated by a factor 
of Inn. To the best of our knowledge, this is the first approximation algorithm for the (hard) capacitated 
facility location problem when the connection cost is non-metric. Also, it is not difficult to observe that 
if instead of the metric inequality, connection costs satisfy a relaxed version of the metric inequality, then 
by proving a relaxed version of the inequality in Lemma 3 and solving the corresponding factor-revealing 
LP, one can obtain the approximation factor of the algorithm. 
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A Proof of Theorem 3 


In order to prove this upper bound, first we prove the following lemma which allows us to prove the theorem 
on the case of sufficiently large k. The proof of the following lemma is the same as the proof of Lemma 14 
in [7] and hence omitted. 


Lemma 4. If z, denotes the solution to the factor-revealing LP, then for every k, z, < Zap. 


Now, in order to prove the approximation factor, the objective is to combine the inequalities of the Program 
4 to derive an inequality of the form ee an < (f+ ae d;). Such a y will be an upper bound on the 
solution of the Program 4. 


We start by relaxing the forth inequality of the Program 4 to the following inequality 


iy i-1 


So (ai - d;) + So max(rj,; — d;,0) < f, (5) 


j=i j=l 
where as in [6] we define J; as follows 


_ [pk ifi<pik 
u={? if i> pik (6) 


Here p; and po are constants with 0 < py < peo < 1 which will be fixed later in the proof. 
Inequality 5 implies 


l ; 


We multiply both sides of the above inequality by a constant 9; that will be fixed later, and add the 
resulting inequalities. This will imply the following inequality. 


se < : aut i 3 d; - Yeti dj, (8) 


i=1 i=1 * j=i j=l 


pO > ee 
N= eg rea ah (10) 
: ae 
i=pik+1 1; aa if fg pok 
Therefore, Inequality 8 can be written as follows. 
k k k 6 i-1 
> Gi08 < Cf + D> didi -— DO eS] > max(rj,; — dj, 0) (11) 
i=1 i=1 i=1 * j=l 


In the above inequality, some 0;’s are greater than 1 and others are less than or equal to 1. The next step 
is to use the inequality a; < aj +1734 +d; +d; to make the coefficient of all a;’s on the left-hand side of 
the inequality equal to 1. We assume 6;’s are chosen in a way that 


Siaae. (12) 


i=1 
Also, we assume there is a constant p3 such that 
Vi < p3k, 0; >1 and Vi > psk, 8; <1 (13) 


We will make sure that 6;’s satisfy the above constraints when we fix their values later in the proof. Now 
consider the inequality a; < aj +7rj4 +d; +d; for i > p3k and 7 < p3k. Multiply both sides of this 


inequality by a constant w;,; > 0, and add up all these inequalities with Inequality 11. 
If we choose w;;’s in such a way that 


S- wij =6;-1 Vi < psk and S- wy =1-4 Vi > psk (14) 
i>psk Jj<p3k 


we will get the following inequality: 


k 
Paseo +|1-O))di+ S> YO wigrya- do 
t=1 


t>p3k j<p3k 


t- 


1 
yy, a mantrs 4,0) (15) 


Now, we define p4 < p; so that we only use triangle inequalities a; < rj;+aj;+dj+d; fori > pok and pak < 
j <psk or for p3xk <i < pok and j < pk. From this definition of p4, first we need that 


pak pak 


YS 0-4) <0 -Y (16) 


j=psk+1 get 
and also we choose w4j;’s such that 0 < pa < pi and 
wig FO => (i > pok and pak < j < psk) or (p3k <i < pok and j < pak) (17) 


Now, using the fact that rj; > 3:41, we can write Inequality 15 as follows: 


k p3k pok 
yas Cf< yi (Ag+|1—-0;))di+ S- S- Wi gl iat y S- Wi, iV j,i SS 3 es a max(rj,;—d;, 0) 
i=1 t=pok+1 j=pakt+1 i=p3k+1 j<pak j=l t=j+1 
k p3k k pak pak p3k ok 
<2 (Ast|1-AN) d+ D> ripe D> wag)t>orimoel D> 9)-d> i‘ oa rss —d;,0) 
i=1 j=pak+1 i=pok+1 j=l i=p3k+1 j=l i=j+i ” 


Using Equations 14 and 17, it turns out that >> 
for pak <j < p3k. Thus, 


i>pok i,j = 9; —1 for j < pak and Yop ci<por M5 = 95-1 
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p3k pak 


k k 
oa — CF SSO + (L-Oildi+ SP rj,poe(85 — 1) + D2 r5,n0(85 — 1) 
t=1 t=1 


j=pak+1 j=l 
pak k 6; p3k k 6, 
ete a 
= Ds De Tee max(rj,; — dj,0) — . S- pe ea max(rj,; — dj, 0) 
pea a aclek j=pak+1 i=Jj+1 


Notice that for i > pok, 73,4 <1j,p.x and for i > psk , 77,4 < Tj,pax- After substitution of these values in the 
above inequality, we get the following: 


k k psk pak 
Soai— CFS DOOit [1 -Oildit SO rjpow(Oj —1) + D> rips (Oj — 1) 
i=1 i=1 j=paktl j=l 
pak p3k 6, p3k pak 0, 
a a 
—S0 rinse - 49) - SD (ria - 5) 
aa ttt omen 4 
p3k k p3sk pak 6; 
< SOA +26: -VY)di+ SS Oi+1-O)di+ SO rypoe(6;-1- So Ee = 7D 
=1 i=pgk+1 j=pak-+1 i=pi t* 
pak p3k 6; 
a 
+2 tele 1— 2) rai 
j=l i=j+1 


Now, if 0;’s satisfy 


por pig 2 65-1 if § < pak 


pa a (18) 
ars § > 0; -1 if pk <j < pak 
This implies 
k p3k k 
= a t=psk+1 


Now, if we set the coefficients of d;’s in the right-hand side of the above inequality to y, we will get the 
following recurrence for 6;: 


Ni + 2(8; -1) = if i < p3k 


M+1-0Q=7 if i> psk (20) 


Notice that A;’s can be written as a function of @;’s (i < j) from Equation 10, thus the above equations 
are recurrence relations for 6;. 


Solving this recurrence, we get 


(3 — 26p,n) *<28* if pgk <i < pok 
0 if pk <i<k 


If we can set the constants pi, p2,p3,p4 in such a way that 6;’s satisfy the Conditions 12, 13, 16, and 18 
then Inequality 19 shows that the solution of the factor-revealing LP is at most max(¢, 7). 


It’s not hard to see that 0;’s are decreasing from 0 to psk and increasing from p3k +1 to pak, thus in order 
to satisfy Conditions 13, it is sufficient to have: 


Ongk > 1 and Opok < 1 (22) 
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In order to write Conditions 12 and 16, we compute the sum of 6;’s for different intervals separately. 


p3k p3k 
i= (y+2)psk- 25°06; and 
j=l 
p3k p3k Jj p3k p3k p3k 
_ = (p3k — a+ 1) 6; 
di “Dom a gD eres a oS eran l,-i+1 
p3k Pit (pi p3k 
)kO; — 1)k6; 
ee eee 
i i=pikt+1 ”* 
p3k 
= S56; + (ps — po) kApyk + (ps — Lbs — Apik) 
j=1 
p3k 
> DE Me = 3 ((7 + 2)p3 + (p2 — 1)Apiz + (1 — Ds) Apex) 
and also 
pak (1—-ps)k—-1 1 
Yo % == Wye )(L— pak SO 7 
i=p3k+1 a —p2)k 
= (3 28pye)(1— ps)k(In —"* + o(1)) 
P2 


From these two equations, we can write equation 12 as follows: 


42 p= Ve 
5 pes Bf = G2 26 Jee li eo 


i mie Sees 
gh 3 1—p 


In order to write 16 we need the following: 


pak pak 
2 (y+2)pak—25°6; and 
j=l 
oe pak j pak pak 
2) fe) rer Fi "Ekin 
pak pak mat tp 
(pak — i + 10; — po)kO; 
> l,-i+1 ape aera ,-i+1 
pak k 
> 7 = 3 (7 + 2)p4 + (p2 = pa) pak) 
i=1 


Therefore, Condition 16 can be written as 


Li +2 Pp. —P 
(pz — 3) — (3 — 29 pau) (1 — ps)n +o(1) < 1s +N em 


For 7 < pak, Condition 18 can be written in terms of p,’s and + as follows: 


p3k 6; a] 6; 
a ———_ > 9, - 1 _ >6;-1> 
Sree dais 2% =e Apa NY 
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0; > 20 3k -—1 VW < pak 
In the last inequality, we substituted A; in terms of 0;’s using Equation 20. Thus, it is sufficient to have: 
pak 2 2Opsk — 1 (25) 


Furthermore, for pak < j < p3k 


5 ara -r;20;-1> 
a -i+1 Sh -i+1-% ee 
6; +Opor 22  Vpsk <j < psk. 
Again, in the last step we used the Equation 20. Thus, it is sufficient to have: 
Onsk + Opok > 2 (26) 


The last observation here is that 


k 
0; 
Bpak $1 C= Da = nak = 1-14 Ora $7 
i=1 * 


Thus, from Inequality 22, we get ¢ < y and it is sufficient to minimize y instead of max(¢, 7). Now, we need 
to find pa < py < p3 < pe such that Inequalities 22, 23, 24, 25 and 26 are all satisfied and y is minimum. 
Notice that all these recent inequalities are in terms of p;’s and 7 (because 6;’s have been written in terms 
of them as well as \;’s). Now we can observe that by setting p, = 0.327, po = 0.737, p3 = 0.539, and 
pa = 0.327, all inequalities are satisfied and y < 1.939 < 1.94 as desired. 
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